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Abstract 

Packing and covering problems for metric spaces, and graphs in particular, are of essential 
interest in combinatorics and coding theory. They are formulated in terms of metric balls of 
vertices. We consider a new problem in graph theory which is also based on the consideration 
of metric balls of vertices, but which is distinct from the traditional packing and covering 
problems. This problem is motivated by applications in information transmission when 
redundancy of messages is not sufficient for their exact reconstruction, and applications 
in computational biology when one wishes to restore an evolutionary process. It can be 
defined as the reconstruction, or identification, of an unknown vertex in a given graph from 
a minimal number of vertices (erroneous or distorted patterns) in a metric ball of a given 
radius r around the unknown vertex. For this problem it is required to find minimum 
restrictions for such a reconstruction to be possible and also to find efficient reconstruction 
algorithms under such minimal restrictions. 

In this paper we define error graphs and investigate their basic properties. A particular 
class of error graphs occurs when the vertices of the graph are the elements of a group, 
and when the path metric is determined by a suitable set of group elements. These are the 
undirected Cayley graphs. Of particular interest is the transposition Cayley graph on the 
symmetric group which occurs in connection with the analysis of transpositional mutations 
in molecular biology |17l 119] . We obtain a complete solution of the above problems for the 
transposition Cayley graph on the symmetric group. 
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1 Introduction: A Graph-Theoretical Approach to 
Efficient Reconstruction 



The problem of the efficient reconstruction of sequences was introduced in [T2J [T2J [Tl] as a 
problem in coding theory, and similar questions about the efficient reconstruction of integer 
partitions were considered in [151 ITS] . In this paper we discuss a graph-theoretical setting in 
which efficient reconstruction problems can be studied as a uniform theory. 

Let r = (V, E) be a simple, undirected and connected graph with vertex set V and edge set 
E . We regard the vertices in V as units of information in the given reconstruction problem, 
and for two vertices x ^ y in V we regard {x, y} as an edge of T if y is obtained from x, or 
vice versa x from y , by a single error or single distortion of information. We might say that 
x and y are erroneous single error representations of each other, and that T is a single error 
graph. The precise definitions can be found in Section 2. The task of the reconstruction problem 
now is to restore or reconstruct the original unit of information from sufficiently many erroneous 
representations of it. In other words, an unknown vertex x in T is to be identified by suitable 
knowledge about its neighbouring vertices in T. 

We denote the path distance between two vertices x and y of T by d(x,y) and we let B r (x) — 
{ y G V : d(x, y) < r} be the ball of radius r centered at x. For given r > 1 denote by N(T, r) 
the largest number N such that there exist a set A C V of size N and two vertices x ^ y with 
A C B r (x) and A C B r (y). Thus any N + 1 distinct vertices are contained in B r (x) for at most 
one vertex x while there are some N vertices simultaneously contained in B r (x) and B r (y) 
for some x ^ y. This means that an unknown vertex of T can be identified, or reconstructed 
uniquely, by any set of A^r, r) + 1 or more distinct vertices at distance at most r from the 
vertex, provides that such a set exists. 

In graph theoretical terms we are therefore required, for an arbitrary graph T and an integer 
r > 1, to determine the number 



and to construct an efficient algorithm by which any unknown vertex x in V can be identified 
uniquely from an arbitrary set of iV(r, r) + I vertices at distance r or less from x. Evidently we 
can assume that r is at most d(T), the diameter of V. Throughout the paper we assume that 
d(T) > 2 and in particular \V\ > 3. 



Problems of this kind have been solved for some graphs and metric spaces of interest in coding 
theory, and to give an impression of such results we review the example of Hamming spaces and 
Johnson spaces. The Hamming space F™ consists of q n vectors of length n over the alphabet 
{0, 1, q — 1} with metric d(x, y) given by the number of coordinates in which the vectors x 
and y differ. This metric space can be represented by a graph T whose vertices are the vectors 
of i 7 "™ with two vectors connected by an edge if and only if they differ in a single coordinate. 
The path distance between two vertices then is the Hamming distance between the corresponding 
vectors. Therefore we can identify F™ with this graph V . In [TJ1 [T3J [TJ] it was shown that for 
any n, q and r we have 



N(T,r) 



max \B r (x) fl B r (y)\ 



(i) 




(2) 
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Furthermore, any x € F™ can be reconstructed from N = N(Fg,r) + l vectors of B r (x), written 
as the columns of a matrix, by applying the majority algorithm to the rows of the matrix. 

For any 1 < w < n — 1 the Johnson space J™ consists of the ("J binary vectors in F% of 
length n and Hamming weight w , where distance is equal to half the (even) Hamming distance 
in F£ ■ This distance coincides with the minimal number of coordinate transpositions needed to 
transform one vector into the other. The Johnson space then can be viewed as a graph T whose 
vertices are the vectors of J™ with two vectors connected by an edge if and only if one is obtained 
from the other by a transposition of two coordinates. The path distance between two vertices of 
r then is the Johnson distance between the corresponding vectors. Therefore we can identify J™ 
with this graph T . In |121 113] it was also shown that for any n, w and r we have 

Furthermore, any x £ J™ can be reconstructed from N — N(J™, r) + 1 vectors of B r (x), written 
as the columns of a matrix, by applying a threshold algorithm to the rows of the matrix. 

In the first part of this paper we make the notion of error graphs precise and develop the theory 
needed to estimate N(T,r) in some general situations. In this respect our main results are 
Theorems [1] and which give lower bounds for N(T, 1) and N(T, 2) in terms of other graph 
parameters. It may be useful to mention that the idea of reconstructing a vertex in a given graph 
has nothing to do, a priory, with the classical Ulam problem of reconstructing a graph from the 
isomorphism classes of its vertex-deleted subgraphs. So we do not refer to the well-known and 
unresolved vertex-reconstruction problem. Nevertheless, error graphs are such a general tool that 
even this problem can be phrased suitably a problem on error graphs. 

In the second part of the paper we deal with error graphs for which the vertex set consists of the 
elements of a group, and where the errors are defined by a certain set of group elements. Such 
graphs turn out to be undirected Cayley graphs, and in Sections 4 and 5 we show that many 
important error graphs occur as Cayley graphs. In Section 5 we discuss how transpositional 
errors in biological nucleotide sequences can be described as errors in the transposition Cayley 
graph Sym„(T) on the symmetric group. The remainder of the paper deals with this graph in 
particular. 

In Theorem 2] we determine the full automorphism group of the transposition Cayley graph 
Sym n (T). The explicit value of iV(Sym„(T), r) can be found in Theorems [5j |6] and [7] for 1 < 
r < 3. To state the main result on N (Sym n (T) , r) for arbitrary r > 1 let c(n,n — r) be the 
number of permutations on {l..n} having exactly n — r cycles. Thus the c(n,n — r) are the 
signless Stirling numbers of the first kind. We also need the following restricted Stirling numbers: 
Let 031(71, n — r) be the number of permutations g on {l..n} having exactly n — r cycles such 
that 1, 2 and 3 belong to the same cycle of g. The main result on N(Sym n (T), r) is Theorem[9] 
It shows that for all r > 1 

r-l 

^(Syni^T),?-) = ^2 c(n,n-i) 

+ c 3 i(n,n — r) +c 3 i(n, n— (r + 1)) . (4) 

for all sufficiently large n. Furthermore, the maximum iV(Sym n (T), r) = \B r (x) (~)B r (y)\ occurs 
for any x 7^ y for which x~ l y is a 3 -cycle on {l..n}. We mention the connection between this 
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theorem and the Poincare polynomial of Sym n (T). When T is an arbitrary finite graph and v 
a vertex of T let Ci denote the number of vertices at distance i from v. Then 

n r ,„ (t) := c * f 

is the Poincare polynomial of T at v. When this polynomial is independent of v we write simply 
Ilr(t). For the transposition Cayley graph Sym n (T) the Poincare polynomial is 

n-l 

n sym„(T) (*) = ^2 c(n,n - i)t l (5) 
i=0 

where the c(n,n—i) are the Stirling numbers appearing in (|4"]l. This shows that the reconstruction 
parameters iV(P, r) are related to important graph invariants. 

In this paper we have avoided technical terminology as far as possible in order to make this 
material accessible to non-specialists. For the same reasons we have added a few key references 
to texts in computing and computational biology. 



2 Errors in Graphs 

We will now fix the notation used for the remainder. Let T — (V, E) be a finite graph with 
vertex set V and edge set E. All edges are undirected and there are no multiple edges or loops. 
Let x, y be vertices. Then x and y are adjacent to each other if {x, y} is an edge. Further, 
d(x, y) denotes the usual graph distance between the vertices, that is the length of a shortest 
path from x to y. Put d(x, y) = oo if x and y are in different components. For i > we let 
Bi(x) := {y 6 V : d(x,y) < i} and Si(x) := {y £ V : d(x,y) = i} be the ball and sphere of 
radius i around x, respectively. 

We put ki(x) — \Si{x)\ and for y £ Si(x) we set 

Ci(x,y) := |{z e 5,-i(d) : d(z,y) = l}\, 
Oi(x,y) := \{z G Si(x) : d(z,y) = l}\, 

h(x,y) := \{z e S i+ i(x) : d(z,y) = l}\. (6) 

It is clear that bo(x,y) = ki(x), that ai(x,y) = ax{y,x) is the number of triangles over the 
vertices x and y, and that C2(x,y) is the number of common neighbours of x and y € S% (x). 
Let 

A = A(r) = max ai(x,y) 

x : y£V : d(x,y) — l 

pi = /i(r) = max c 2 (x,y). (7) 

x,y£V, d(x,y)—2 

Since \B r {x) (~l B r {y)\ > for x ^ y only if d(x 7 y) < 2r we have 

N(T,r) = max N s {T,r) (8) 

Ks<2r 
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where 



N s (T,r) = max \B r {x) n B r (y)\. 

x,y£V, d(x,y)=s 



(9) 



In particular, Ni(T, 1) = A + 2 and N 2 (T, 1) = \i so that 

AT(r,l) = max(A + 2,/i). (10) 

Finding or estimating the value iV(r, r) for graphs of interest in applications is the main aim of 
our investigation here. We note the following general bounds for N(T, r). 

Lemma 1 Suppose that x y are vertices in the connected graph T = (V, E) at distance 
s = d[x,y) from each other. Let r > be an integer. 

(i) If r > s then B r (x) ("1 B r {y) = B r - S (x) U [(B r (x) \ B r - S (x)) n B r (y)]. In particular, we have 
N s (T,r) > \B r -„(x)\ and 

N{T,r)>\B r ^{x)\ (11) 

for some x G V. 

(ii) If r < s then B r (x) n B r (y) = B r (y) n [B r (x) \ B s _ r _i(a;)] . 

Proof: One should think of B r {x) \ B r ^ s {x) as an annulus around x. (i) Starting on a path of 
length s from y to x any vertex in B r - S {x) can be reached by a further path of length at most 
r — s . The other statements arc immediate from this, (ii) This is an direct consequence of the 
triangle inequality. □ 

We set 

ki(T) = maxfci(i) and k(T) = ki(F). (12) 

Then T is regular of valency k (or k -regular) if all its vertices have constant valency k = k(T) . 
A k -regular graph is distance-regular if the numbers Ci(x,y) and bi(x,y) (and hence ai(x,y) = 
k — Ci(x,y) — bi(x,y)) do not depend on x £ V and y € Si(x) , for all i = 0, 1, <i(r) . A 
distance-regular graph of diameter 2 is strongly regular. A good reference to strongly regular 
graphs is Chapter 21 in [5T] or also [3]. In such a graph there are integers A and fi so that 
any pair of vertices x ^ y is simultaneously adjacent to exactly A vertices if {x, y} is an edge, 
and to exactly [i vertices if {x, y} is not an edge. Our use in (0 of the symbols A and fi is 
therefore a natural extension to graphs which are not strongly regular. 

Let Aut(r) be the automorphism group of T. If T is vertex-transitive (that is, for any two 
vertices in V there is an automorphism of T mapping one onto the other) then ki{x) = ki(T) 
is constant for all x G V and i . In particular, such a graph is regular. However, even for 
vertex-transitive graphs the Ci(x,y) and bi{x,y) usually depend on y 6 Si{x), and this can 
cause difficulties in finding N(T,r). This phenomenon can be observed already on relatively 
small graphs, see the Remark following Lemma SJ 

The Hamming and Johnson graphs are examples of error graphs in which two vertices x ^ y are 
joined by an edge if and only if there exists a single error (the substitution of a symbol or the 
transposition of two coordinates, respectively) which transforms x to y and there exists a single 
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error which transform y to x. This observation leads to a natural general theory of single errors 
which we began in |13j . For this we let V be a finite (or countable) set. A single error on V is 
an injection h : Vh V defined on a non-empty subset Vh C V so that h(x) ^ a; for all x £ Vh ■ 
A non-empty set H of single errors will be called a single error set, or just error set, provided 
the following two properties hold: 

(i) For each h £ H and x £ Vh there exists some g £ H so that h(x) £ V g and g(h{x)) = x, 
and 

(ii) For all distinct pairs x,y £ V there exist x = x\,x^, —,x m = y £ V and /ii,/&2, ■■■,h m -i £ 
H such that x^+i = hi(Xi), for i = 1, m — 1 . 

For such a set H we construct the error graph Th = (V,E) where E — {{x,h(x)} : x £ 
V and h £ H} . Note, by the conditions on H we see that Th has no loops and that all edges 
are undirected. The condition (ii) says that there is a path between any two vertices, and hence 
that Th is connected. Furthermore, the usual path distance d(x, y) on Th now measures the 
minimum number of single errors required to transform x to y or y to x . 

It is easily seen that every connected simple graph T can be represented as an error graph where 
we can assume in addition that the single error set consists of involutions (that is, partial maps 
h defined on suitable subsets of V such that hr 1 = h). For if c: E —> C N is an edge 

colouring of T then each fiber c _1 (i) with i = l,...,x defines a natural involutionary error hi 
which is obtained by interchanging the two end vertices of any edge coloured by i. In particular, 
every connected graph T is an error graph with at most x = Xe(J?) errors where Xe(X) is the 
edge-chromatic number of T. 

By Vizing's theorem [22] this minimum number (over all T) is equal to k(T) + 1 where k(T) 
is the maximum degree of T, as in (|12j) . It is a natural question to ask whether any connected 
simple graph T can be represented as an error graph Tjj for some error set H of cardinality 
fc(r). The answer is affirmative, see [13] . where it is also shown that the property (i) can in 
general not be replaced by a stronger property H = H^ 1 (meaning that h^ 1 £ H if h £ H). 

In the examples discussed before, the Hamming graph is an error graph when V = F™ and when 
H consists of the n(q — 1) non-zero vectors h £ F™ of Hamming weight 1, with action given 
by h{x) = h + x for x £ V . Also the Johnson graph J™ is of this form when we view V as 
the set of all w -element subsets of {l,..,n} and when H is the set of all («) transpositions 
(i,j) interchanging i and j in {l,..,n}, in their natural permutational action on V obtained 
by permuting the coordinates of vectors. In order to make sure that the single error property 
h(x) 7^ x holds for all vertices x £ Vh one has to restrict the domain of (i,j) to those sets which 
contain exactly one of i and j . 

Similarly, the insertion and deletion errors for finite sequences over an alphabet A can be de- 
scribed in this fashion as an infinite error graph. As vertex set we consider the set V = 
A° U A 1 U A 2 U ... U A n U ... of all finite words over A. As single error set we take H := 
{di, d 2 , .., d m , ...} U {ii(a), 12(a), i m (a), ■■■ : a £ A} where d m deletes the m th entry in 
any word of length > m while i m (a) inserts a as the m th entry in any sequence of length 
> m — 1 . As expected, the usual graph metric is indeed the Levenshtein error distance [11] for 
sequences. Situations where the model of undirected single error graphs is not applicable include 
asymmetric errors, some further comments can be found in |13j . 



6 



3 Some Bounds for Regular Graphs 



For the remainder we assume that T is a connected and regular graph on v > 4 vertices, with 
degree 2 < k — k(T) and parameters A = A(T) , fi = fJ,(T) . We have < A < k — 1, 1 < fi < k 
and the diameter of T is d(T) > 1. 

For some classes of regular and strongly regular graphs on v vertices we have iV(r, 1) = o(v) 
as v — > oo. The following strongly regular graphs are well known, see Chapter 21 in [5T] or [1]. 
The triangle graph T(m) is strongly regular with parameters v — m(m — l)/2, k = 2(to — 2), 
A = m — 2, /i = 4 and hence N(T(m),l) = to. The lattice graph L2{m) is strongly regular 
with parameters v = to 2 , k = 2(to — 1), A — to — 2, n = 2 and hence 7V(L2(to),1) = to. 
Meanwhile the Paley graphs P(q) ( q a prime congruent to 1 mod 4 ) is strongly regular with 
parameters v — q, k=(q — l)/2, A = (g-5)/4, /x = (g-l)/4 and hence N(P(q), 1) = (g+3)/4. 
The complement of a strongly regular graph T is also strongly regular (although not necessarily 
connected). This complementary graph T has parameters v(T) = v, k(T) = v — k — 1, A(r) = 
v - 2k - 2 + n, fi(T) = v - 2k + A, and hence N(T, 1) = v - 2k + max(/i, A). 

Let O l m = O m -k O m ★ ... ★ O m be the product of t copies of the empty graph on to vertices. 
This is the complete t -partite graph with v = tm, each part consisting of to vertices and edges 
connecting vertices from different parts in all possible ways. If t > 2 this graph is connected and 
strongly regular. 

The complete graph on v vertices is denoted by K v . We recall that a 1-factor of a graph is a 
collection of disjoint edges covering all vertices (a complete matching of the vertices of T ) . When 
v is even consider the graph obtained from K v by removing the edges of a 1-factor. This graph 
is strongly regular with parameters k ~ fi = v — 2 , X = v — 4 and coincides with 0\ with t = § ■ 
Conversely, if T is a regular of degree k = v — 2 then v is even and r = 0\ with t = | . When 
t = § then N(Ol, 1) = A + 2 = v - 2 = |(« + A). More generally we have: 

Theorem 1 Let r be a regular graph with k < v — 2. Then we have 

N(T,l)<±(v + \) (13) 

with equality if and only if k — X = v ~ k divides v and T is the strongly regular graph with 
m = k — X and v = tm. 

Proof: By dTUJ) we have N(T, 1) = max{A + 2, ^}. If T = O l m with v = tm then k = v - to, 
/i = v — to and A = v — 2m so that N(T, 1) = /x = + A). For the converse assume first that 
A = fe — 1. In this case (fTU|) implies that N(T, 1) = k + 1 which is not possible as A: + 1 is the 
cardinality of any single ball. Therefore A < k — 2 and from the assumptions in the theorem it 
follows that X < v — 4 or ^A + 2< ^v. Hence A + 2 < ^(v + A) with equality if and only if 
A = v — 4. In the latter case only k = v — 2 is possible and so we have the situation already 
discussed, T is with m~k — X — v — k = 2 and v — 2t. 

It is left to show that /i < | (v + A) and to find the conditions for equality. For a A: -regular graph 
(V, E) and a vertex cc in ^ we count the number of edges between Si{x) and S^x). This gives 

^2 (k-l-a 1 (x,y))= ^2 c 2{x,z), 

y£S!{x) zeS 2 (x) 



7 



see again the definitions in (0. This gives fc(fc — 1 — A) < nk%(x) and since k 2 (x) < v — k — 1 
we obtain 



k(k- 1 - A) < nk 2 {x) < fi{v- k- 1). (14) 
Since 1 < \i < fc we get fc — 1 — A<« — fc — 1 and hence /i < fc < ^(i> + A) as required. 



If /i = fc = I (f + A) then we have equalities in (fT4"]l . For a regular graph it is well-known that the 
inequalities in (114[) turn into equalities if and only if the graph is strongly regular, see for instance 
Problem 21A in [21]. So let T be strongly regular with /x = fc. Then any pair of distinct and 
non-adjacent vertices have the same fc neighbours. It follows that x = x 1 or x is not adjacent 
to x 1 defines an equivalence relation on the vertices of T, with all equivalence classes of size 
m := v — fc. Hence m divides v = tm and T = OL,. □ 



Theorem 2 (Linear Programming Bound) Let T be a regular graph of valency fc > 2. Then 



N 2 (T, 2) > M (k - 1 - i(/i - l)(AT(r, 1) - 2)) + 2. 



(15) 



We note that this rather general bound is quite sharp, see the comment following Theorem [5] 

Proof: There are two vertices x,x' e V with d(x,x') — 2 so that the set Y = {yx, of all 

vertices at distance 1 from both x and x' has \i > 1 elements. If /x = 1 then y\ has fc — 2 
neighbours other than x and x'. It follows that -/V 2 (r, 2) > |-E>2(aO H i?2(a;')l > 3 + fc — 2 and so 
(fl~5|) holds. Hence we assume that fi > 2. 

Let J7 = UiLi -^i(2/i)\{ a: ) a; '}- We show that the number of elements in U is at least /i (fc — 1 — i(/i — l)(7V(r, 1) — 2)) . 
For h = 1, /i let ?7(/i) be the vertices of U which belong to exactly h of the sets i?i(j/i), as 
i = l, fi. In particular, U = U(l) U ... U £/(//) is a partition and so 



\U\ = J2\U(h) 



h=l 



Next observe that the set {(z, Bi(y)) : y £Y and z € B%(y) U } has cardinality 



]T fc|tf(/i)| = M(fc-i) 



and the set {(z, {i?i(y), Bx{y')}) : y ^ y' eY and z e Si(y) n i?i(y') n U} has cardinality 




^=2 V 7 fe.j'lCY,!/^' 



The last inequality holds as y ^ y 1 implies \B\(y) n 5x(y')| < N(T, 1). 



Set u/j := for h = 1, and it := |?7|. To find a lower bound for u we minimize 



it — /i(fc — 1) 



lU2 - 2m 3 
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for the non-negative integers u%, • subject to the constraints 

ku h < - 1) 

ft =2 

and 



h=2 



2/ \2 



Let u* < u — — 1) be the required minimum. Then by the duality of linear programming, 
see for instance Section 7.5 in |16j . the value of u* maximizes 

- M (fc-l)m - Q (N(T,l)-2)n 2 
subject to n,i, Ti2 > and the dual constraints 

hni + (^j n 2 > h — \ for h = 2, fj, . 
Note that n\ = and n 2 = 1 satisfies the dual constraints for all fi > 2 and hence 

t*-M*-l)>u*>-Q W,i)-2) . 
Therefore u > /i (fc - 1 - - l)(JV(r, 1) - 2)) as required. □ 



Note for instance that N 2 (T, 2) > fc+l when fi = 1, N 2 (T, 2) > 2k when fi = 2 and iV(T, 1) = 2, 
and N 2 (T, 2) > 3fc - 4 when fi = 3 and iV(r, 1) = 3 . 



Corollary 1 Suppose that T is a regular graph of valency k with no triangle nor pentagons. If 

N 2 (T,2)>N 1 (T,2). 



fi > 2 and k > 1 + Oij then 



Proof: We have A = since V has no triangles so that N(T, 1) = fi by (JTUJ). Similarly, 
ATi(r, 2) = 2k as T contains no pentagons. Using ([15]) we get 

7V 2 (r,2)-2fc > p(k-l-~(ji-i)(N(r,l)-2)j+2-2k 

= fi fk - 1 - ^(fi - l){fj, - 2U +2 -2k 

= (M-2)(fc-l-Q) >0 

and this completes the proof. □ 
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4 Single error sets as group generators 



An important class of graphs associated to single error sets is obtained when the vertex set of the 
graph are the elements of a finite group. So we let G be a finite group and consider the elements 
of G = V as the vertices of the error graph T = Th for some error set H. The neutral element 
of G is denoted by e = ec and 1 = {eg} is the identity subgroup of G. We suppose that the 
single error set is determined as a subset H of G so that the action of errors on vertices is given 
by the group product. That is, if h £ H and ieG then h(x) := xh^ 1 . In this situation H is 
a single error set if and only if H does not contain ec and 

(i) H satisfies H = H- 1 {= { /i" 1 : h £ H }) , and 

(ii) H generates G as a group. 

The first condition is clear since there is some g in H with g(h(x)) = (xh )g 
= x for a vertex x in V if and only if g = h^ 1 belongs to H. The second condition is a 
restatement of the connectedness of the error graph. Note that we have set h(x) := xh , rather 
than h(x) := xh. This is advisable so that the multiplication of errors as elements of G agrees 
with the co-cattenation of the corresponding maps, (gh)(x) — x(gh)^ 1 = xh~ 1 g~ 1 — g(h(x)). 

As is well known, in this situation Th is the undirected Cayley graph on G for the generating 
set H, and H is the Cayley set for Th ■ Note conversely that every undirected Cayley graph 
can be viewed as a single error graph. 

In the following we review some of the theory of Cayley graphs from the viewpoint of single error 
graphs. Let H be a Cayley set in the finite group G with corresponding graph Tjj — (V, E) on 
the vertex set V = G and let Aut(F#) be the automorphism group of Th- We consider two basic 
kinds of automorphisms of Th ■ For each g in G the left-multiplication on V 1 with g : x i— > gx 
for x £ V, induces an automorphism of Th since g: {x, xh^ 1 } i— > {gx, gxh -1 } maps edges to 
edges. If we think of {a;, xh' 1 } as being labelled by h — {x' 1 (xh' 1 ), (hx~ 1 )x} = {h^ 1 , h}, 
the quotients of its end vertices, then {gx, gxh -1 } has the same label as {x, xh~ x }. Therefore 
left-multiplication by elements of G are automorphisms that preserves all edge labels. 

This action is transitive on vertices and only the identity element fixes any vertex. This is 
therefore the regular action of G on itself. This property characterizes Cayley graphs: T is the 
Cayley graph of some group if and only if T admits a group of automorphisms that acts regularly 
on its vertices, see for instance Chapter 6 in pQ. Note however that the graph usually does not 
determine the group. 

We now describe graph automorphisms that change edge labels. Let C be a group of automor- 
phisms of G as a group. For the action of /3 £ C we write /3: x i-> (3(x) and so (3(xy) = (3(x)/3(y) 
as j3 is an automorphism of the group structure. We will also require that C preserves H, in 
the sense that (3(h) £ H for all h £ H and (3 £ C. Then C is a group of automorphisms 
of Th since (3: {x, xh^ 1 } n- {(3(x), ^(xh^ 1 )} — {(3(x), (3(x)(3(h)~ x } maps edges to edges, as 
(3(h)- 1 = (3(h- r ) £ H. Note that the label of (3({x, xh^}) now is /3(h). 

The semi-direct product G C is the (abstract) group of all pairs (g, (3) with multiplication 
(g' , (3')(g, (3) = (g'(3'(g), (3' (3). It acts on the graph as automorphisms by setting 

(g, (3) : x H> g(3(x) for x £ V. 
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This gives an injective group homomorphism from G-C to AutT# so that we can regard G-C 
as a subgroup of AutT#. We collect these facts: 

Proposition 1 Let Th be the error graph on the group V = G with error set H. Then the 
left-multiplication of vertices by elements of G forms a group of automorphisms of Th which 
acts regularly on the vertex set V. If C is a group of automorphisms of G (as a group) such that 
(3(H) C H for all (3 in C then the semi-direct product G-C is contained in the automorphism 
group of T H - 

A common example of this situation occurs when we consider conjugation by group elements. 
Let b G G. Then conjugation by b is the automorphisms 

x i y bxb^ 1 =: x b for x G G 

and 

x G := {x b : be G} 

is the conjugacy class of x. In this case the error set H is invariant under conjugation if and 
only if H is a union of conjugacy classes. For C we then take the group C := G/Z where 
Z = Z(G) = {b G G : x b = x for all x in G} is the center of G. These are the inner 
automorphisms of G. So in this situation G-C is a group of automorphism of Th- In Chapter 5 
we will analyse this example further when G is the symmetric group Sym n on the set {l..n} and 
when H is the set of all transpositions on {l..n}. There we shall see that the full automorphism 
group of the error graph can be larger than G ■ C, even if C is the group of all automorphisms 
of G as a group. 

Another interesting example occurs when T is the Hamming graph. Here G is the vector space 
F™ where F q is the field of q elements and H is the set of all vectors of the shape (0, .., 0, a, 0, .., 0) 
with a^O. Then G acts on itself as a group of translations, that is, maps of the kind g : x t-» g+x 
for all x G F™. For C we can take the monomial subgroup C = (F* )™ • Sym n C GL(n, q) acting 
naturally as linear maps on V. More precisely, C is the group of all n x n matrices with exactly 
one element from the multiplicative group F* in each row and column. So here F™ ■ C is a 
group of affine linear maps on F™ that acts naturally as automorphisms on the Hamming graph 

r. 

Considering again the general case we let Th — (G, E) be an error graph with error set H. We 
have seen that any group automorphism (3 fixing H as a set induces an automorphism of Th- 
Evidently [3 also fixes the identity element e — ec in G. Assume therefore more generally that 
C is a group of automorphisms of Th which fixes e. For any x G V 

x c := {/?(*) : (3 G C} 

is the orbit of x under C. In order to analyze the parameters h(x), ai(x,y), bi(x,y) and 
Ci(x,y) note that Th is vertex transitive and therefore it suffices to consider the spheres with 
center ec ■ Hence we abbreviate all parameters, writing Si, Bi , fcj = |Sj|, a,(y) , h(y) and 
Ci(y) , suppressing the reference to x — ec in each case. In general these parameters still depend 
on y although automorphisms provide at least for some form of regularity: 

Proposition 2 Let Th be the error graph on the group V = G with error set H and suppose 
that C is a group of automorphisms of Th which fixes e — ea ■ Then for each i > the sphere 
Si = Si(e) is a union of C -orbits. 
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Further, suppose that y and y' belong to the same C -orbit and that r, t > 0. Then \S r DSt(y)\ — 
\S r n S t (y')\ and \B r n B t (y)\ = \B r n B t (y')\. In particular, a.i{y) = a,(y'), 6f(y) = 6,(y') and 
Ci{y) = Ci(y') /or all i > 0. 



Proof: Let y £ Si and let e, t/i, j/i = ?/ be a shortest path from e to y. If /? G C then it 
is clear that /3(e) = e, (3(yi), /3(j/i) = /3(y) is a shortest path from e to It follows that 

f3(Si) = Si is a union of C-orbits. Now suppose that y' = (3(y). Then (3 (Si(y)) = Si (/3(y)) = 
Si(y') and so \S r nS t (y)\ = \/3 (S r n S t (y))\ = \P (S r )nj3 {S t (y)) \ = \S r nS t (y')\. The remainder 
follows immediately, including the statement on <ij, bi and Ci since these numbers are of the 
shape \S r (~l *S*t (2/) I f° r particular choices of r and t. □ 

If H is the single error set of Th we set H° := {eg} and H l := HH 1 ^ 1 inductively for i > 0. 
Clearly, r# is regular of degree fc(T) = \H\ . If as before Si denotes the sphere of radius i 
around e — ea then evidently S\ — H 1 — H, S2 = H 2 \ (H 1 U H°) and more generally, 

S, t = H i \ (H*- 1 U H 1 - 2 U ... U H 1 U 

The following is easily shown and gives the value of N(Th, 1) by using (JTUJ). 

Lemma 2 In an error graph Fh with error set H we have 

A(T#) = max I {(h, h') : x = hh' with h, h! £ H} | and 

fi(T H ) = max I {(h, h!) : x = hh' with h,h' £ H} \ . 

xeS2 



5 Permutations distorted by transpositional errors 

In the following we consider Cayley graphs when G = Sym„ is the symmetric group acting on 
the set {l..n}. Any subset H of G which generates G with e £" H and H = H^ 1 is a Cayley 
set for G. 

We express permutations in the usual cycle notation. (Throughout the word 'cycle' always refers 
to a particular kind of permutation, and never to a graph or subgraph.) A transposition on 
{l..n} is a permutation of the shape x — with 1 < i ^ j < n if we suppress the 1 -cycles 

of x . Particularly important graphs occur when H — {(1, 2), (2, 3), (n — 1, n)} are the n — 1 
Coxeter generators of the symmetric group. These form a minimal set of transpositions needed 
to generate Sym n . This set corresponds to the fundamental reflections associated to a chamber 
for the A -type Dynkin diagram. The chambers give rise to a triangulation of the euclidean 
unit sphere in K n-1 . In this situation the graph distance function d(x,y) in Th is a discretized 
version of the geodetic distance on this sphere and presents the distance between two facets in the 
triangulation of the sphere, see for instance the book [2] of Grove and Benson on Unite reflection 
groups. In this interpretation B r (x) is the 'cap' of facets on the sphere at distance < r from 
the facet x and N(T,r) is the number facets common to two such caps, with suitable distinct 
centers. Note also that here d(e, — ) evaluated for a single variable is the word length function 
in the corresponding Weyl group. This Cayley graph is of considerable importance in Lie theory 
and in many other parts of mathematics and physics. For a recent treatment of its combinatorics 
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we refer to [3]- We add that in computer science this graph is known as the bubble-sort Cayley 
graph and is used as a model for interconnection networks [5J 15] ■ Various other Cayley graphs on 
Sym n have been considered in the literature, we mention in particular Diaconis' book [5] where 
metrics on Sym n more generally are discussed. 

By contrast we may consider the error graph on Sym„ when the single error set H consists 
of all transpositions on {l..n}. This clearly is a highly redundant system of generators, 

situated at the other extreme to the case of the Coxeter elements in Sym n which form a minimal 
generating set. In this situation a single error (i, j) transforms the vertex x to its neighbour 
x(i,j) and all choices for 1 < i ^ j < n are admissible. A graph Th of this type will be called 
a transposition Cayley graph, and these graphs are the subject of the remainder of the paper. 

It may be useful to describe errors of this kind in a slightly more general setting. Let A 
be a finite alphabet with |^4| > 2 and let A n be the set of all words of length n over A. 
Then the single transposition error (i, j) on the coordinates of A n is the map (i, j) : a = 
(cii,..., a;, cij, a n ) i — y a^'i' = (oi,..., dj, a^, a„) with all other entries of a un- 
changed. This gives rise to an error distance g?a on A n where oIa{cl, b) is the least number of 
single transposition errors needed to transform a to b, if this is possible. In this case we must 
have b = a 9 for some g £ Sym„ and c?yi(a, b) < d{ec, g) where the latter denotes the distance 
in the transposition Cayley graph. (Observe that eU(a, a*- 1 '^) = if and only if at — Oj while 
d(ea, (i, j)) = 1 independently.) Note that this distance function defines a graph on A n . Each 
component is an error graph with involutory errors (i, j) if we restrict the domain of the single 
error (i, j) to the words a in which a; ^ a,-. In this way the transposition error graph Th can 
be said to control the transposition errors on A n . 

In molecular biology transpositional errors are one of the three known mechanisms in the mutation 
and evolution of genetic information. The so-called replication slippage applied to a nucleotide 
sequence is a process that results in some strings of consecutive nucleotides being reversed or 
repeated in the sequence. Such replication slippages usually recur and give rise to so-called mi- 
crosatellites which contain a high degree of information about the evolutionary process undergone 
by the nucleotide sequence in question, and often this happens in the non-coding part of the nu- 
cleotide sequence. For general information see Futuyma's book [7] on evolutionary biology as 
well as [T7] and [TO], 

Replication slippage is therefore a combinations of two kinds of errors on sequences, on the one 
hand the insertion-deletion process already mentioned at the end of Section 2 and the trans- 
positional errors in the transposition Cayley graph on the other. It may be worth to mention 
that the other principal mutation mechanisms are point mutations referring to the replacement 
of one nucleotide by another, and frame shifts which are the insertion or deletion of a group of 
nucleotides. Both of these are therefore covered by the insertion-deletion process. 

Evidently any interval transposition or reversal (of a part of a nucleotide sequence) can be ex- 
pressed as a product of single transpositional errors. However, it should be interesting to introduce 
such products as new single errors, and to consider the resulting error graph on Sym n . A second 
point of interest should be to study the resistance to transpositional errors: As the nucleotide 
alphabet consists of just four letters, a single transpositional error is expressed only in a small 
proportion of all possible words in A™, leaving many others unchanged by that error. 

Returning to the general discussion of the transposition Cayley graph we note the following 
conventions. Permutations in Sym n are multiplied from the right so that (xy)(j) — x(y(j)) for 
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all x, y € Sym n and j £ {l..n}. If x is written as a product of hi disjoint cycles of length i 
for 1 < i < n then the cycle type of x is denoted as ct(x) = l hl 2 2 ...n n . Here it is essential 
to include 1 -cycles so that J2i ihi = n - As * s well-known, two permutations are conjugate to 
each other through an element of Sym„ if and only if they have the same cycle type. Writing 
G = Sym n therefore the conjugacy class 

(l h ^2 h2 . .. n hn ) a :=x G = {g- 1 xs : geG} 

is the set of all permutations having the same cycle type as x . 

We let H = T := { e Sym„ : 1 < i ^ j < n} = (l n ~ 2 2 r ) G be the set of all transpositions of 
{l..n} . Thus Tt is the transposition Cayley graph on Sym„ and will be denoted by Sym„(T). 
The following collects some easily established facts. 

Lemma 3 For n > 3 the transposition Cayley graph Sym n (T) is a connected (^) -regular graph 
of order n\ and diameter n — 1. It is t -partite for any 2 < t < n. 

Proof: The group Sym n has order n\ and is generated by its (™) transpositions. Its diameter 
is at most n— 1 since any permutation is a product of at most n — 1 transpositions. On the other 
hand, an n -cycle can not be written in terms of fewer than n — 1 transpositions. No two elements 
in the same sphere Si could be adjacent to each other as they have the same determinant (— l) 1 . 
Hence So, Si, S2, S„_i is a partition into n parts from which a t -partition can be obtained 
for any t < n . □ 

For the product of a permutation with a transposition the following simple rule is essential. If 
x = (ii, .., ifc)(j'i, .., je) consists of two disjoint cycles and if t = (i,j) interchanges elements from 
different cycles, say i\ = i and ji = j without loss of generality as the cycles are determined 
only up to cyclic reordering, then 

xt = (ii,j2,j3,»,3tJi>i2,i3,»,ik) =■ s (16) 

is a single cycle obtained by joining up the two cycles of x . Conversely, upon multiplying this 
equation again by t , we see that multiplying the single cycle s by a transposition of some two 
elements from that cycle gives x — xtt = st , hence splitting that single cycle into two cycles. 
Therefore multiplying any permutation x by a transposition results in a permutation which either 
joins up two cycles of x or splits one cycle of x into two, with no other changes. 

Following the earlier convention whereby Si = Sj(e) we have that H = T = Si consists of all 
transpositions, S2 consists of all 3-cycles k) and all double transpositions (i,j)(k,£) with 
i, j, k, I distinct, and so on. As multiplication by a transposition increases or decreases the 
number of cycles by one it follows by induction that Si consists of all permutations expressible 
as a product of n — i disjoint cycles, counting also all 1 -cycles. 

The path distance between two permutations x and y is the least number d of transpositions 
ti such that xt\...td — y ■ Equivalently d is the least number of transpositions needed to write 
x~ x y and also equal to the number of bisections and gluings needed to transform the cycles of 
x into those of y . The number of distinct paths from x to y is equal to the number of paths 
from e to x _1 y and about these the following theorem gives complete information. It is based 
on Ore's theorem on the number of trees with n labeled vertices, see also Theorem 2 in |10) . 
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Theorem 3 Suppose that x has cycle type ct(x) = 1 1 2" 2 ...n n and let 1 < i < n — 1 be 
such that Y]j—i hj — n — i is the number of cycles in x. Then the number of distinct ways to 
express x as a product of i transpositions is equal to 

n / nJ- 2 \ hj 



By the discussion above x cannot be written in fewer than i transpositions. The special case 
i = n — 1 and h n = 1 means that each of the (n — 1)! cycles of length n has n n ~ 2 different 
representations as a product of n — 1 transpositions. This number coincides with the number of 
trees with n labelled vertices, see also Section 5.3 in Stanley [20] . 

Let 1 < i < n — 1. If y € Si has cycle type ct(y) = l hl 2 h2 ..n hn and consists of Xw=i = n — i 
cycles then y is a product of i transpositions. As dety = (— l) 4 we must have ai(y) =0. As a 
single cycle of length j can be split into two cycles as in (TTB1) in different ways, we have Ci(y) = 

E"=i (i) h 3- From SjU = n [t f°ll° ws that Cf(y) = X)"=i (2)^ = | fe"=i J 2/l j - n ) ■ 
If we regard y as an element of Sym TO with m > n then it is clear from (|16[) that Ci(y) is 
independent of n. Finally, h(y) — (™) — Ci(y) . We collect these facts: 



Lemma 4 In Sym„(T) i/ie set Si , where 1 < i < n — 1, consists of all permutations of {l..n} 
which are composed of exactly n — i disjoint cycles, including 1 -cycles. 

If y € Si has cycle type ct(y) = l hl 2 h2 .. n hn then 



a i(y) = and 



2 . 

// y is regarded as an element in Sym m with m > n then only bi{y) depends on n. 



Remark: Loosely speaking, if y belongs to Si we can think of Ci(y) as the 'downward' degree 
of y, namely the number of neighbours of y in the next lower sphere Sj_i. The fact that this 
degree is independent of n will be used later on. Similarly (™) — Ci(y) is the 'upward' degree 
of y. The transposition Cayley graphs are not distance-regular and they illustrate the fact that 
the up- and downward degrees are not constant for elements in the same sphere. This can be 
seen already in Sym 4 (T). If y in S2 is a 3 -cycle then c 2 (y) = 3 according to the three choice 
of a transposition splitting the 3-cycle. On the other hand, if y = (1,2)(3,4) in S2 is a double 
transposition then C2(y) — 2 as there are just two ways to split one of the two cycles. This is 
true for any n > 4. 



Next we discuss the automorphism group of the transposition Cayley graph. As before let G = 
Sym n = V and set F = Sym ra (T). Let (a, 6) be an element of the direct product G x G . Then 
(a, 6) : x i-> axb^ 1 for x € V is an automorphism of F since for any transposition t we have 
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xt i-> axtb^ 1 = (axb^ 1 )(btb^ 1 ) in which btb -1 again is a transposition. Note that only the 
identity of G x G fixes all vertices since axb~ x = x for all x S implies that a = b and hence 
that a £ Z(G) = 1. This implies that we can view G x G as a subgroup of Aut(r). Recall the 
discussion in Section 4. If we let C be the group of conjugation automorphisms, x <—> x b for 
all b G G, then C is the diagonal subgroup {(b,b) : b G G} C G x G. Furthermore, we have 
G x G = G-C as subgroups of Aut(r). 

A further automorphism of V comes from the inversion map 

i: x O a; -1 for x G V. 

While z is not an automorphism of the group it is an automorphism of the graph. For if {x, y} is 
an edge with y = xt and t a transposition then y" 1 = x~ 1 (yty~ 1 ) where yty~ x is a transposition 
and so {y~ l ,x~ 1 } is an edge. Since (i(a,b)i) (x) — (ax~ 1 b~ 1 )~ 1 = (b,a)(x) for all x G V we see 
that i normalizes G x G by interchanging the two direct factors. This shows that the semi-direct 
product (Sym ra x Sym n ) • (i) is contained in Aut(Sym n (T)) . 

Theorem 4 For n > 3 the full automorphism group of Sym Tl (T) is the semi-direct product 
(Sym n x Sym n ) • G2 where Ci = (1) is the group of order 2 obtained by inverting the elements 
in V = Sym n . 

Proof: As before set G = Sym„, T = Sym„(T) and let A be the group of all automorphisms 
of r. When n = 3 when T is the complete bipartite graph ^3 3 and in this case the statement 
can be checked directly from the description of the action of (Sym 3 x Sym 3 ) • (1) on T . 

Now suppose that n > 3 and let a" G A. As G acts vertex transitively by left-multiplication we 
select g" G (G x 1) C (G x G) such that a' := g" a" fixes cq . This implies that a' fixes S r as 
a set, for all r > 1, see Proposition [51 Furthermore, a' fixes each of the two conjugacy classes 
(1™ _3 3 1 ) G and (1™ _4 2 2 ) G in S% since C2 = 3 on the first class while C2 = 2 on the second class, 
see the remark following Lemma [4] 

For 1 < i < n let the pencil Pi be the set Pi = { (i,j) G Si : 1 < j < n and i ^ j}. 
Then the following holds: any pair x ^ y G Pi has exactly two joint neighbours in (1" _3 3 1 ) G , 
and Pi is a maximal subset of Si with this property. Conversely, any set of n — 1 vertices 
in Si satisfying this property is a pencil. Since (l n_3 3 1 ) G is invariant under a' we see that 
a' (Pi) again is a pencil. On the other hand, the diagonal element (g,g) G G x G satisfies 
(9,9)(i,j) = 9 ■ ■ g^ 1 = (g(i),9(j)) so that (g,g)(Pi) is the pencil P gW . This means that 
the diagonal group induces the full symmetric group on pencils while fixing ec ■ In particular, 
we can find some g' = (g,g) G G x G such that a := g'a' fixes each pencil as a set, in 
addition to the vertex cq. Let x = (i,j) be an element of Si. Then {x} = Pi D Pj so that 
{a(x)} = a(Pi) fl a(Pj) = {x}. Hence a fixes all elements in Bi(ea) pointwise. 

Note that (1,2), (1,3) and (2,3) are pairwise joined to two elements in S2 , and no others, 
namely x = (1,2,3) and y = (1,3,2). Thus a fixes {x, y} as a set and if 1 denotes the 
inversion automorphism mentioned before, then either a or ict fixes all of Bi(ec) U {(1,2,3)} 
pointwise. By the following lemma either ig'g"a" or g'g"a" is the identity automorphism of 
T and so a" = g"~ 1 g'~ 1 i or a" = g"~ 1 g'~ 1 belongs to (Sym„ x Sym„) • G 2 . □ 
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Lemma 5 For n > 3 only the identity automorphism of Sym n (T) fixes every vertex in Biiea)^ 
{(1,2,3)}. 



Proof: This is evident for n = 3. Suppose therefore that n > 4 and that a is an automorphism 
fixing every vertex in B\(ea) U {(1, 2, 3)}. 

Then each double transposition in (l n_4 2 2 ) G is fixed by a as these elements have exactly 
two neighbours in Si, with no two double transpositions having the same Si -neighbours. The 
elements in (l™" 3 ^ 1 ) fall into pairs [(i,j,k), (i,k,j)] of 3-cycles, each pairwise linked to the 
three fixed elements (j,k) and (i,k) in Si. Therefore a either fixes or interchanges the 

members in each pair. We show that a fixes these elements and hence is the identity on S2 ■ 

Evidently (1,2,3) and (1,3,2) are both fixed. Hence look at the three pairs [(1, 4, 2), (1, 2, 4)] , 
[(1, 3, 4), (1, 4, 3)] and [(2, 3, 4), (2, 4, 3)]. As can be calculated, the six 4 -cycles in S3 involving 
1, 2, 3 and 4 are partitioned into two sets X , all connected to (1, 2, 3), and Y, all connected to 
(1,3,2). The sets X and Y are therefore fixed by a as sets. It turns out that (1,4,2) is linked 
to two vertices in X while (1, 2, 4) is linked to two vertices in Y . This means that (1, 4, 2) and 
(1,2,4) are each fixed by a. The same argument extends to all other 3-cycles. Hence ^(cg) 
is fixed pointwise. For the remainder the argument becomes more homogeneous. Suppose that 
x and y = a[x) are in S r with r > 2. By induction we can assume that a fixes all vertices in 
Sr—i and this means that x and y have the same neighbours N(x) = N(y) in S r —i. We claim 
that this forces x — y. The elements in N(x) are obtained by 'splitting' any cycles appearing in 
x into two cycles in all possible ways, see (|16p . In particular, x and y have the same orbits on 
{l..n} and if there are at least two orbits of length > 1 then N(x) — N(y) forces x = y. In the 
remaining case x and y consist of a single cycle of length t > 4 with all other vertices fixed. It 
is easy to see that I > 3 and N(x) — N(y) again forces x = y. □ 



6 Distance statistics in the transposition graph 

Let Si be the sphere of radius i < n — 1 and centre ec in the transposition Cayley graph 
Sym n (T). Then Si is a union of Sym„ -conjugacy classes and the parameters a,i(y) , Ci(y) and 
bi(y) are constant on these classes, for all 1 < i < n — 1. It will be useful to set Sj(n) := |iSj|, 
and in more customary symbols, c(n, n — i) := \Si\. 

Then c(n, n — i) is the number of permutations in Sym Tl having n — i cycles, for 1 < i < n — 1, 
and these are the signless Stirling numbers of the first kind, see for instance Chapter 1.3 in 
Stanley's book [20]. We have c(n,n) = 1, c(n,n - 1) = (™) , c(n,n - 2) = 2(3) + 3(£), 
c(n, n — 3) = 3(") + 20Q) + 15(g) and so on, up to c(n, 1) = (n — 1)1 . The generating function 
of c(n, to) satisfies 



: = ^2 c(n,m)t 



t(t + l)---(t + n-l) 



m— 1 

and from this we get the product form 



n Sym „(T) = = (1 



+ 



t)(l + 2t).--(l + (n-l)t) 
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for the Poincare polynomial ([5]) of Sym n (T). From the definition it is clear that Si(n) is a 
polynomial in n when i is fixed. The leading term counts the number of permutations of cycle 
type \ n ~ 2% 2 % and so we note: 



Lemma 6 If i is fixed and n > 2i then Sjfn) is a polynomial in n of degree 2i. Its leading 
term is the leading term of TrfoM":^ ) ' ' ' (" 2* +2 ) an ^ * s e 1 ua ^ t° TErr™ 21 - 



For y € Si with cycle type ct(y) = l hl 2 h2 ...n h " let as before (l hl 2 h2 ... n hn ) G = y G be the 
conjugacy class of y . Then 

2 h \..n h "f* 



l h ^hi\2 h2 h 2 l---n h »h n \ ' 
and 



*= u 

hi+h^-i \-h„=n- 



h\ cyh 2 



(V 11 2 



(18) 



see again Chapter 1.3 in [20]. Omitting cycle types of multiplicity we therefore have Si — 
(l n - 2 2 1 ) G , S 2 = (l n_3 3 1 ) G U (1"- 4 2 2 ) G and so on. For small values of r one can compute 
-/V(Sym n (T), r) easily from this information. 



6.1 The value of N(Sym n (T),r) for r < 3 

As we have observed, Sym„(T) is not distance- regular and as a consequence it is not straightfor- 
ward to determine the value N(Sym n (T),r) for general r. We begin to evaluate iV(Sym„(T), r) 
for r < 3 when closed formulae can be obtained. 

Theorem 5 For n > 3 we have 

JV(Sym n (T),l)=3. (19) 



Proof: From Lemma [4] we have A(Sym n (T)) = since ai{z) = for z € Si and moreover 
C2(y) = 3 if y has cycle type ct(y) = l™~ 3 3 1 and C2(y) = 2 if y has cycle type ct(y) = l™~ 4 2 2 . 
Therefore, from © we have ^(Sym„(T)) = 3 and by HTDJl we get (fI5]l. □ 



Theorem 6 For n > 5 we have 

N(Sym n (T), 2) = N 2 (Sym n (T), 2) = ^(n + l)(n - 2). (20) 
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Remark: From this result one can see that the bound in Theorem [2] is indeed very good: 
Working out the parameters for the transposition Cayley graph gives the bound N(Sym n (T), 2) > 
iV 2 (Sym„(T),2) > |(n + l)(n - 2) - 1 from Theorem 

Proof: By vertex transitivity it suffices to compute |i?2 H i?2 (y) | with Bi = i?2(e). This quantity 
depends only on the conjugacy class to which y belongs, this is a consequence of Proposition [5] 
and Theorem |4l Therefore we need to consider the number N(y) of all vertices in B% which are 
at distance < 2 from a given vertex y G Si when i runs from 1 to 4. By ()18[) we have 

S 4 = (l n - 5 5 1 ) G U (1"- 6 2 1 4 1 ) G U (l n ~ 6 3 2 ) G U (1™- 7 2 2 3 1 ) G U (l n ~ 8 2 4 ) G , 
S 3 = (l n ~ 4 4 1 ) U (l n " 5 2 1 3 1 ) 6 U (I™" 6 2 3 ) G 

and so on. The numbers N(y) are presented in Table 1. The row index is the conjugacy class 
which contains y while the column index is the conjugacy classes contained in Bi- The value of 
N(y) is worked out using (IT51) . 



N(y) 








(1™) U 




10 


10 








(l n - 6 2 1 4 1 ) G 


4 


6 








(1™- 6 3 2 ) G 


2 


9 








(1"- ? 2 2 3 1 ) G 


1 


7 








(1™-«2 4 ) G 





6 








(in-4 4 l)U 


4 


2 


6 





( 1 n-& 2 1 3 1 ) U 


1 


3 


4 





(l n - B 2 3 ) G 





3 


3 





(1 „-3 3 1)G 


6(ra - 3) + 2 


3(V) 


3 


1 


(l n ~ 4 2 2 ) G 


4(n - 2) 


2(V)~1 


2 


1 


(l n - 2 2 1 ) G 


2(n - 2) 


(V) 


(?) 


1 



Table 1 



When we consider the corresponding rows in the table we get iV4(Sym n (T), 2) = 20 when n > 5, 
iV 3 (Sym n (T),2) = 12 when n > 4, iV 2 (Sym„(T), 2) = §(n+ l)(n - 2) and jV^SymjT), 2) = 
(n — l)n for all n > 3. This proves the theorem due to |5]). □ 



To estimate the number of vertices in |B r n B r (y)\ for an arbitrary y we consider the paths 
t\ti • • ■ t r * with r* < r starting at y and leading to a vertex z = yti<2 ■ ■ ■ tr* belonging to 
B r . We say that this path has a descent at step k < r* if ytit2 • ■ -£fe-i £ <S S for some s 
while yiit2 • • - tk-itk G S s —i- The number of ways to continue the path at 2/ti<2 ■ • l by a 
descent is the downward degree c(ec 2/^1^2 • ■ - ife-i) of ([6]) which by Lemma [4] is independent 
of n. Similarly, we say that the path has an ascent at step k if yijia * • * ^fc— 1 S while 
J/^1^2 • 4 • tk-itk G iSs+i- In this case the number of choices to continue the path at yt\ti ■ ■ ■ tk—i 
by an ascent is the upward degree b(ec, yt\t2 • ■ • tk-i) which by LemmaU] is of order n 2 . Hence 

Lemma 7 The number of vertices z = yt\ti ■ ■ - t r * reachable from y on a path with a ascents 
is at most k y n 2a where k y is some constant independent of n. 
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Let .E^i+i be the set of edges joining a vertex in Si to one in Si+\. As Sym n (T) is fc-regular 
with k = \Si\ and as ai(z) = for all z G Si, see Lemma[U we have l-E^-i^l + \Ei,i + i\ = k ■ \Si\ 
and hence 

\E r _ 1>r \ = k ■ (\s r -x\ - \s r - 2 \ + \s r - 3 \ - ... + (-lf- 1 !^!) (21) 

for all r. For the transposition y G T let E r -i ir (y) be the set of all edges in E r _i >r of the form 
{z, zy} . (These arc the edges in E r —±, r that are labelled by y .) Evidently all automorphisms of 
Sym ra (T) fixing ea permute E r -i lT as a set and since the conjugation action is transitive on T 
every edge label must appear an equal number of times in each orbit. Hence 

\E r _ ltr (y)\ = |5 r _i| - |S r _ 2 | + |S r _ 3 | - ... + (-ly-'lSol (22) 

for all r. If y = (Ji,j 2 ) then the end vertices v~ G S r -i and v + G S r of {v~ ,v + } G E r -i jr (y) 
are composed of cycles in which ji and j 2 occur in different, respectively the same, cycle(s). 
Hence (|2"2"|) gives the number of permutations in S r _i with ji, ji in different cycles and, at the 
same time, the number of permutations in S r with j\, j% in the same cycle. 



Theorem 7 Let T = Sym ra (T) be the transposition Cayley graph and suppose that n > 4. 
Then we have 

(i) N X (T, 3) = 2|5 |+2|S , 2 | and 
(«) N 2 (T,S) = |5 | + |5i| + |5 2 | + (n + 2)(n-3) + 
„ , fn — 3\ nn fn — 3\ „ /n — 3 
+ 24 ( 2 j +22 ( 3 j +6 ( 4 

Furthermore we have N(T, 3) = iVa(r, 3) for all n > 16. 



Proof: We need to compute \B 3 n B^{y)\ when e = and j/ have distance d{e,y) < 6 from 
each other. When die, y) = 5 or 6 then a path of length < 3 from y to a vertex in B3 can not 
have any ascents. Therefore the number of such vertices is independent of n by Lemma [7] (The 
same phenomenon can be seen in the upper part of Table 1.) When d(e,y) — 3 or 4 then the 
corresponding paths have at most one ascent so that I-B3 (~1 Bs(y)\ is of order at most n 2 . When 
d(e,y) — 1 or 2 then \B 3 D S3 (t/) | is of order at least n A as we will show now. It follows that 
the cases 3 < d(e, y) < 6 can be ignored for large enough n, and a lower bound for n for this to 
be true will be given at the end of this proof. 

(i) Finding Ni (Sym„ (T) , 3) : Let y be in Si. Then by Lemma □ we have \B 3 n S 3 (y)| = 
|£?2|+Af(y) where M(y) is the number of vertices z 6 S3 with d(z,y) < 3, and hence d(z,y) = 2. 
If z = ytit 2 G 53 with transpositions ti then also z" 1 = ^2^12/ G S3, see Theorem 2) Hence 
M(y) is the number of all t 2 tiy belonging to S3. As any element in S 2 is of the shape £2*1 f° r 
some h and t 2 we see that M(y) = \E 2i3 (y)\ = \S 2 \ - \Si\ + \S \ by (p2|l. Therefore 

iVi(Sym n (T),3) = |B 2 | + | 1 S , a|-|5i| + | 1 S , o| = 2|5 a | + 2|So|. (23) 



(ii) Finding A^2(Sym ra (T), 3) : Let y = yiy 2 be in S2 with transpositions yi. By Lemma [T] 
we have \B 3 (~l i?3 (y) | = |£?i| + M(y) where M{y) is the number of vertices z G S2 U S3 with 
d(z,y) < 3. As above, if z — yt\---t r * G S 2 U S3 with r* < 3, consider instead z~ 1 
/ ... • • - t\y 2 y\ G S*2 U 63, that is, all paths from eg of length < 5 ending in 2/22/1 at a vertex in 
5*2 U S3. Let Z be the set of all such vertices z _1 , in particular then M(y) = \Z\. 
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Let u = tit\ G S2 be arbitrary. If t\ = y± then u = (£22/2)2/22/1 G Z C\ S2- Otherwise uyi = 
(^2^12/2)2/22/1 G ZnSjj. Denote the vertices in Z of this kind by Zo, in particular then |i?o| = IS^I- 

Next let e = {v + ,v~} 6 £2,3(2/1) with tj + = G 53 and v~ = v + y\ G S%. Then u~ belongs 
to Z if and only if w + j/2 belongs to Let the vertices of this type be denoted by Z\. Thus \Z\\ 
is the number of u = v + G S3 such that both uyi and uy2 belong to S2. When j/i = (1,2) and 
2/2 = (2, 3) then (16]) implies that \Z 1 | is the number of elements in (1, 2, 3)(4, 5) G or (1, 2, 3, 4) G 
with 1, 2, 3 in the same cycle. This number is 

\Zi\ = 2 ( n 2 3 ) +6(n-3) = (n + 2)(n-3) . (24) 

When ?/i = (1,2) and y 2 = (3,4) then \Z\\ is the number of elements in (1, 2, 3)(4, 5) G , 
(1,2, 3,4) G or (1, 2)(3, 4)(5, 6) G in which 1, 2 and 3, 4 appear in the same cycle(s). This number 
is 

|Z 1 |=4(n-4) + 6+( n 2 4 ) = Q . (25) 



Finally let e = {v + ,v~} be in £3.4(2/1) with w + G S4 and v~ G 5*3. Then v~ belongs to Z if 
and only if u — v + G ^4 has the property that both uyi and 1*2/2 belong to S3. Let Z2 be the 
set of all such vertices v~ . When j/i = (1, 2) and j/2 = (2, 3) then | ^2 1 is the number of elements 
in (1,2,3)(4,5,6) G , (1, 2, 3)(4, 5)(6, 7) G , (1, 2, 3, 4)(5, 6) G or (1,2,3,4,5) G with 1,2,3 in the 
same cycle. This number is 



n — 3 
3 



+ 3!(n - 3) 



24 



n — 3 
2 

4 



4! 
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n — 
2 

n — 3 
2 

- G 



n — 3 
4 



(26) 



(Note, the term ( ra 2 3 ) (™ 2 5 ) accounts for the two choices of a 3-cycle on {1, 2, 3} while avoiding 
duplication in the choice of two 2-cycles from the remaining n— 3 and n— 5 vertices, respectively.) 



When j/i = (1,2) and j/ 2 = (3,4) then \Z 2 \ is the number of elements in (1, 2, 3)(4, 5, 6) G , 
(1,2,3)(4,5)(6,7) G , (1,2,3,4,5) G , (1, 2, 3, 4)(5, 6) G or (1, 2)(3, 4)(5, 6)(7, 8) G in which 1, 2, and 
3, 4 appear in the same cycle (s). This number is 



\Z 2 \ = 4(n-2)(n-5) + 



+ 4!(n-4) + 



\ In— 4\ / n — 6 



4(n - 4) 
4. 

6 



n — 5 
2 

n - 4 
2 



n - 4 
3 

n - 4 
2 



+ 



= 24(n-4) + (n-5)(13n~44) + 14 



n - 4 
3 



71-4 

4 



(27) 



It is clear that Z = ZqU ZiU Z 2 is a disjoint union. Comparing ([2~4"| + (f2l)| to (f2"5j) + (|2"T|) one can 
check that the first expression is bigger than the second for all n > 4. Hence n Bs(y)\ takes 
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its maximum when y S (1, 2, 3) G for all n > 4. Therefore 

7V 2 (Sym„(T), 3) = |5 | + + \S 2 \ + (n + 2)(n - 3) + 

n4 fn — 3\ „„/ti — 3\ „(n— 3\ ,„„. 
+ 24^ 2 J+22^ 3 j +6 ( 4 j (28) 

and this complete the second part of the theorem. 

We now return to the comment at the beginning of this proof. Comparing (|23l) to (|2"8)l shows 
that A^(r, 3) > -/Vi(r, 3) for all n > 4. When y belongs to S3 or S4 a very rough upper 
bound for \B 3 n B^{y)\ can be obtained by following through the argument in Lemma [7] By 
Lemma S] the downward degree for a vertex in Sj(e) is at most ) while the upward degree 
is at most (™) — j- By considering the possible ascent-descent combinations of a path from y to 
a vertex in B 3 we can work out that N A {T, 3) < -455 + 155 Q) and N 3 (T, 3) < -132 + 65 (™). 
Using the same arguments we can bound Nq(T, 3) < 1575 and iVs(r, 3) < 525. Evaluating these 
inequalities it can be seen that iVa(r, 3) > Nj(T, 3) for j = 3, 4, 5, 6 from n > 16 onwards. We 
note that a better lower bound n > 10 can be obtained by a more careful albeit tedious count 
of the possible paths. This completes the proof. □ 



6.2 The asymptotic behaviour of iV(Sym n (T), r) 



The main work in this section will be to find JVi(Sym n (T),r) and A^2(Sym n (T), r) for arbitrary 
r and sufficiently large n. It will turn out that this determines N(Sym n (T),r) in general. Let 
b(n,r) denote the cardinality of the ball of radius r in Sym n (T), thus 

b(n,r) = \B r \= 2J s i( n ) = X! c(n,n-i) 

0<i<n 0<i<n 

in terms of the signless Stirling numbers of the first kind. 

First we consider A r 2(Sym„(T), r). By Lemma U we have N% (Sym„ (T) , r) = 6(n, r — 2) + 
|(SV U SV-i) H i? r (y*)| where y* suitably is a 3-cycle or a double transposition. We set A := 
\(S r U S'r-i) n £? r (y*)| and let y* = yiy 2 with two transpositions yi. We now need to find the 
number A of all z in SV U S r -\ which can be reached on a path t1t2--.tr* of length r* < r 
starting from y* . This means that z — y\y2^\^2---ir* and applying the inversion automorphism, 
see Theorem[4j we obtain the element — t r * ■ ■ ■ ^12/22/1 in S r US r —i. This represents a path 
from e = ea in which y 2 and y\ are the last edges. If Z denotes the set of all such elements 
z- 1 then \Z\ = A. 

Let u be in S r -\ and suppose that u = t r —x ■ ■ - t2t\ is the product of suitable transpositions 
t t . If ti = yi then u = (t r -i ■ ■ ■ £22/2)2/22/1 so that u E Z. Otherwise uyi = (t r -i ■ • • ^2*12/2)2/22/1 
belongs to Z. Thus every u in SV_i gives rise to one element in Z. The set of elements of this 
type is denoted by Zq, and in particular \Zq\ = |5 r _i|. 

The next type of vertices in Z are of the shape z~ x — t r * •••£2*12/22/1 where both z~ x and 
t r * ■ ■ - t2t\ belong to S r -\ while t r * ■ ■ • £2*12/2 belongs S r . This type will be called Z±, evidently 
this set is disjoint from Zq. 
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The remaining vertices in Z are of the shape z^ 1 — t r * • • • ^2^12/2^/1 where both and 
t r * ■■■t 2 t\ belong to S r while t r * ■■•t 2 t\ 7/2 belongs SV+i- These are the vertices of type Z 2 
and it follows that Z — Zq U Z\ U Z 2 is a disjoint union. Therefore 



N 2 (Sym n {T),r) = b{n,r-l) + ma X (\Z 1 \ + \Z 2 \ : y* e S 2 ) (29) 

where Z\ and Z 2 depend on the choice of y* as either a 3 -cycle or a double transposition. We 
can now prove the following theorem: 

Theorem 8 Let T = Sym„(T) be the transposition Cayley graph. Suppose that r > 2 and 
that y is a transposition. 

(i) For n — 1 > r we have 

ATi(T,r) = b(n,r-l) + \E r ^ r {y)\ = 

= 2 - (l-ffr-xl + |5 r _ 3 | + l-Sr-sl H ) . (30) 

(ii) For n sufficiently large we have 

N 2 (T, r) > iVi(r, r) . (31) 

Proof: (i) Let y be in Si. By Lemma Q] we have Ni(L,r) = \B r C\B r (y)\ = \B r _i\ + \S r nB r (y)\. 
Hence we need to find the number M{y) = \S r H B r (y)\ of all z e S r which can be reached on a 
path of length < r from y . Such a path is necessarily of the shape z — yt\t 2 ■ ■ ■ t r _i, consisting 
of r — 1 transpositions . Applying the inversion automorphism, as above, we obtain a path 
= t r —\ ■ ■ -t 2 t\y starting from e to z^ 1 £ S r with y as last edge. Evidently any vertex in 
SV-i can be reached by a suitable choice of t r -\ ■ ■ - t 2 t\ and therefore M(y) — \E r _i tr (y)\. The 
result now follows from (1221) . 

(ii) Let Z\ and Z 2 have the same meaning as in (|2"9"|) . By the first part of this theorem and 
the equation (|2"9"|) it will be sufficient to show that \Z 2 \ > \E r -i f7 .(y)\ for all large enough n. 
We evaluate \Z 2 \ for j/* = y\y 2 with yi = (1,2) and y 2 — (2,n). The vertices in Z 2 are in 
one-to-one correspondence with vertices u G S r +i such that wyi and uy 2 € S r . By the basic 
property (TT6l) this is the case if and only if l,2,n belong to the same cycle of u. Let U be the 
set of such elements, \U\ — \Z 2 \. To count \U\ consider elements u' in the symmetric group G' 
on {1, 2, n — 1} which have the following properties: (i) both 1 and 2 are in the same cycle 
of u' , and (ii) u' has (n — 1) — r cycles. Thus u' is in S r Pi G" and it follows from (|2"2"|) that the 
total number of such elements u' is 

a: = |£ r -i,r(2/i) H { {x, x*} : x, x* e G'} 

= \S r -l n G'\ - \S r -2 n G'\ + \S r - 3 D G'\ - ... + (-lf-^So H G'| . 

By Lemma[5] it follows that the coefficient of the leading power of n in |S r _i| and in \S r -i H G"| 
is the same. Therefore 

o=|S r _i, r (yi)n{{i,s*} : v'eC'lh |S r _i| + /(n) 
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for a polynomial f(n) of degree < 2(r— 1) — 1. Any v! of the kind just considered is a permutation 
on {1,2, ...n — 1} in which 1 and 2 appear in the same cycle, say of length c u < > 2. Now we 
may insert n into that cycle, in c u i distinct ways, to get c u i different elements in U. Thus 
\Z 2 \ = \U\ > 2a and therefore 

\Z 2 \>2-\Sr-i\+2f(n). (32) 

Since the leading term of \E r _i^ r (y)\ is |S r _i| by (j2"2"]l it follows that \Z 2 \ > \E r -i ir (y)\ for all 
large enough n. □ 



In the expression iV 2 (Sym n (T),r) = b(n,r - 1) + max(|Zi| + \Z 2 \ : y* £ S 2 ) stated in the 
term \Zi\ + \Z 2 \ depends on the choice of y*. We therefore turn to evaluating \Zi\ + \Z 2 \ for the 
two possible choices y* = (1,2,3) and y* = (1,2) (3, 4). 

This leads us to the following definition. Let C31 (n, n — i) be the number of vertices in Si in 
which the letters 1,2,3 appear in a single cycle, and let c 2 2 (n, n — i) be the number of vertices 
in Si in which the letters 1,2 and 3,4 appear in the same cycle or cycles. For instance, 

c 3 i(n,n) = C31 (n, n — 1) = 0, c 3 i (n, n — 2) = 2, 
c^i(n, n — 3) = (n + 2)(n— 3), and 

= 24(" 2 3 ) + 22(" 3 3 ) +6 (^ 3 ). (33) 



Similarly we have 

c 2 2(n,n) = c 2 2 (n, n — 1) = 0, c 2 2 (n, n — 2) = 1, 

c 2 2 (n, n - 3) = , and 

c 22 (n,n-4) = 24(n-4) + (n-5)(13n-44) 

For this see again ([M]) , (1231) , (|26p and (|27p . As we have already observed in the proof of the last 
theorem, the general rule (Til))) implies that Z\ and Z 2 in (12T)1) satisfy 

\Zi\ + \Zi\ = c a i(n,n — r) + c 3 i(n, n - (r + 1)) (35) 

when y* is a 3 -cycle and 

^1 1 + 1-^2 1 = c 2 2(n,n -r) + c 2 2(n,n- (r + 1)) (36) 

when y* is a double transposition. We obtain an estimate for C31 (n, n — r) and c 2 2 (n, n — r) as 
follows: 



Lemma 8 for fixed i > 2 a^c? n sufficiently large we have 

2 /n- 



, 2 /n-3\/n-5\ /n + 3 - 2i\ ,. 

c 3 i(n,n _,) = ___ ... +/! (37) 
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and 

1 (n - 4\ (n - 6\ (n + 2 - 2i\ 
)=(-^T( 2 J( 2 J-( 2 )+/» ( 38 ) 

where the fi are polynomials in n of degree < di — 2(i — 2). In particular, c 3 i(n,n — i) and 
c 2 2 (n, n — i) are polynomials of degree di and c 3 i (n, n — i) = 2 c 2 2 (n, n — i) + / 3 with a polynomial 
f 3 of degree < d t . 



Proof: Let C 3 i (n, n — i) C Si and C 2 2 (n, n — i) C Si be the sets counted by c 3 i (n, n — i) and 
€22(71, n — z) respectively. For i = 2. when 0! = 1, we see that C 3 i(n, n — 2) consists of the 
two 3-cycles (1,2,3) and (1,3,2) while C 2 2(n, n — 2) consists of the single double transposition 
(1, 2)(3, 4) only. This established the base of induction and accounts for the factor 2 throughout. 
We will prove the statement ([3T1) concerning c 3 i (n, n — i), the corresponding statement (|3"5|) for 
c 2 2 (n, n — i) follows in exactly the same way. 



If g G Sym n let supp(g) be its support, that is all symbols moved by g . The cardinality of the 
support of any g £ C 3 i(n,n-i) is at most 3+2(i-2). Let C n := C31 (n, n-i) n(l n ^ 2l+1 2^ 2 3 1 ) G 
and let C| = C 3 i (n, n - i) \ C^. Then 

. _,• , 2 (n — 3\ /n — 5\ f n + 3 — 2i 

C l n = 



y °' (i - 2)1 V 2 J V 2 y V 2 
and by induction we assume that \C\ \ — fi has degree < di . Since 

. _,. , , . 2 (n — 3\ /n — 5\ /n + 3 — 2i\ /n + 1 — 2i 



y °' — 1)! v 2 y v 2 y v 2 / v 2 

it remains to show that the number of elements in C{ +1 is a polynomial of degree at most d, + 1. 

By considering cycle types it is easy to see that any vertex in C{ +1 has at least one neighbour 
in Cg or in C\. In the first case, if g = u ■ (j'1,.72) with u 6 CJ then at least one of {ji, j 2 } 
must be in the support of g as otherwise g € Cq +1 . The number of such elements g therefore 
is polynomial of degree at most di + 1. The number of vertices of the second kind is clearly at 
most /i( 2 ), again of degree at most di + 1. Hence c 3 i(n, n — i) has the required expression. In 
the case of c 2 2 (n, n — i) the same arguments apply. □ 



Theorem 9 Let T = Sym n (T) be the transposition Cayley graph and suppose that r > 1. 
Then for all sufficiently large n we have 

N(T, r) = N 2 (T,r) 

= b(n, r — 1) + C31 (n, n — r) + c 3 i (n, n — (r + 1)) . (39) 



Remark: For r < 3 we already have computed the value of iV(Sym n (T), r) in Theorems [SJ [6] 
and [7] when N(Sym n (T),r) indeed agrees with (|3"9")l . In these theorems the lower bound on n 
was explicit and hence better than the condition here. Nevertheless, analysing the arguments 
here it is likely that the bound n > 3r is sufficient. 

Proof: We can assume that r > 3. By (J23), ([35]) and Lemma El it follow that N 2 (T,r) = 
b(n,r — 1) + c 3 i(n, n — r) + c 3 i(n,n — (r + 1)) and this establishes the second equation. By 
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Theorem [8] we know that iV2(Sym n (T), r) > Ni(Sym n (T),r) and both terms are polynomial of 
degree 2(r — 1). It remains to show that N s (Sym n (T) , r) is polynomial of degree < 2(r — 1) for 
2 < s. For y £ S s consider all paths z = yt\ ■ ■ ■ t r * of length r* < r to a vertex z S 5 S * with 
s* < r. If a and & are the number of ascents and descents then a + b = r* < r and a — b = s* — s. 
From this it follows that a < r — 1 and the required fact now follows from Lemma [7] □ 
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